Replication Data for: THE ART OF SELF-CRITICISM: HOW AUTOCRATS PROPAGATE THEIR OWN POLITICAL SCANDALS

Charles Chang, 2023, "Replication Data for: THE ART OF SELF-CRITICISM: HOW AUTOCRATS PROPAGATE THEIR OWN POLITICAL SCANDALS", To Be Inserted, Harvard Dataverse, DRAFT VERSION, UNF:6:BKpCBBvMZeTMvup9aSBHXg== [fileUNF] 

replication_code.Rmd is the main R notebook file

database.backup is the backup file for a PostgreSQL (15.X) database that has all the data. Due to the file size limit at Harvard Dataverse, the backup file is stored at Zenodo, DOI 10.5281/zenodo.10330003. To run the .Rmd, you will need to restore this database in PostgreSQL first. 

In addition, python_files include all the Python scripts used to scrape data and to train the BERT model for classification.

train_bert_ac.py employs a BERT model to train classifiers for corruption news and predict_acnews.py uses the model trained to predict unlabelled news into corruption/non-corruption news

sina_domesticnews_getnews.py and sina_get_comments.py are two files that can collect Sina News and its comments.
